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DETAILED ACTION 

Claim Rejections - 35 USC § 112 

1 . The following is a quotation of the second paragraph of 35 U.S.C. 112: 

The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 

2. Claims 1,3-48 are rejected under 35 U.S.C. 1 12, second paragraph, as being indefinite for 
failing to particularly point out and distinctly claim the subject matter which applicant regards as 
the invention. As per independent claims 1,19,22,27,31,32, the claim recitations pertaining to 
obtaining/segmenting audio signals based upon audio characteristics are vague and indefinite 
because it is not clear as to which segmenting aspect of applicants disclosure this refers. For 
example, applicants disclosure presents two segmenting sections, the first being 1) a t)q)ical 
audio encoder that extracts audio signal information (outputting segments based upon 
voice/unvoiced, silence decision - fig. 4, line 110 into subblock 12, generating segmented audio 
with associated parameters - line 112; and applicants specification, page 13 lines 8-14); and the 
second 2) being the re-segmented sequence of initial segments based upon a degree of voicing, 
etc., derived from speech parameters (fig.4, subblock 20, and applicants spec, page 15, lines 1- 
17). The current claim scope does not distinguish between these two sections of applicants 
disclosure and as such, these claims are rejected under 35 U.S.C. 1 12 second paragraph. For art 
related examination purposes only, examiner will interpret the claim scope to read upon the first 
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section discussed above, namely, the encoder section of Fig. 4 that encompasses only line 110, 
subblock 12, and line 1 12. The dependent claims do not remedy the deficiencies of the 
independent claims, and as such, are also rejected under 35 U.S.C. 1 12, second paragraph. 

Claim Rejections - 35 USC § 102 

3. Claims 1, 3-14, 19-21,26-37, 39-44,46-48 are rejected under 35 U.S.C. 102 (b) as being 
anticipated by Gersho et al. (6,31 1,154). 

As to claim 1, Gersho et al. teach segmenting {partitioning or classifying} the audio 
signal {speech} into a plurality of segments {frames} (partitioning samples of a speech signal 
into frames, col. 4, lines 25-27) based on the audio characteristics {classes} of the audio signal 
(classifying the speech signal in each from into one of a plurality of classes, col. 4, lines 25-27); 
and encoding the segments {frames} with different encoding settings {excitation} (encoding an 
excitation for the frame using one of a pliirality of excitation coding.. .selected according to the 
class of the frame, col. 4, lines 30-33). 

As to claim 3, Gersho et al. teach characteristics {classes/classifying} include voicing 
characteristics {voice} in said segments {frames} of the audio signal { speech signal} 
(classifying the speech signal in each frame into classes, classes include voice frame, col. 4, lines 
25-27 & 35). 

As to claim 4, Gersho et al. teach characteristics {identifying} include energy 
characteristics {presence of energy} in said segments {window} of the audio signal {residual 
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signal } (identifying the location of a window, identifying considers the presence of energy in the 
residual signal, col. 4, lines 65-67). 

As to claim 5, Gersho et al. teaches characteristics {positioning} include pitch 

characteristics {function of the pitch) in said segments {frames} of the audio signal (positioning 
the window at a location that is a function of a pitch of the frame, col. 4, Unes 59-61). 

As to claim 6, Gersho et al. teach segmenting {partitioning} is carried out concurrently 

{classifying and encoding} to said encoding {coding} (partitioning samples of speech, 
classifying speech signals into classes, coding a speech signal, col. 4, lines 24-25. The 
classifying and encoding process may be done concurrently). 

As to claim 7, Gersho et al. teach segmenting is carried out before said encoding 
(partitioning samples of speech, classifying speech signals into classes, coding a speech signal, 
col. 4, lines 24-25, thus the classifying or segmenting is done before coding). 

As to claim 8, Gersho et al. teach plurality of voicing values {voice or unvoiced} are 
assigned to the voicing characteristics of the audio signal in said segments, and wherein said 
Segmenting {partitioning} is carried out based on the assigned voicing values (classifying a 
frame is being one of an unvoiced or voiced, col. 4, lines 52-53). 
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As to claim 9, Gersho et al. teach a value designated {classifying} to a voiced speech 
signal and another value designated to an unvoiced signal (classifying a frame is being one of an 
unvoiced or voiced, col. 4, lines 51-52). 

As to claim 10 Gersho et al. teach a value designated {classifier} to a transitional stage 
between the voice and unvoiced {transitional} signals {frame} (classifier for classifying a 
fransition frame, col. 4, lines 52-55) 

As to claim 1 1, Gersho et al. teach a value designated {(m)=l } to an inactive period 
{silent frame} in the audio signal {speech} (If (m)=l, then the current frame is declared a silent 
frame, col. 15, lines 7-8 & 35-37). 

As to claim 12, Gersho et al. teach selecting a quantization mode for said encoding in 
order to improve the bit allocation and to reduce the parameter update rate, wherein the 
segmenting is carried out based on the selected quantization mode (col. 3 lines 45-49; Fig. 5 and 
col. 11 lines 4-16; col. 4, lines 36-37, col. 15, lines 35-36 & col. 9, lines 63-65). 

As to claim 13, Gersho et al. teach segmenting is carried out based on target accuracy in 
reconstruction of the audio signal, wherein the target accuracy is selected based on distortion 
criteria comparing up-sampled quantized values (fransmitted samples) and modified parameters 
(col. 9, lines 63-65 and col. 3 lines 45-49). 
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As to claim 14, Gersho et al. teach segmenting is carried out for providing a linear pitch 
representation in at least some of said segments (col. 9, lines 63-65; col. 3 lines 45-49 and col. 4 
lines 50-62). 

As to claim 19 and 27, Gersho et al. (154) teach an input for receiving audio data 
indicative of the parameters in the adjusted representation (input applied to element 14, Fig. 3). 
and a module responsive to the audio data for generating the audio signal based on the adjusted 
signals and the characteristics of the audio signal (Fig. 3. One would necessarily need a module 
to respond to an adjusted audio signal/characteristics of audio signals). 

At the time of the invention, it would have been inherent to one of ordinary skill in to use 
a decoder in order to reverse the encoding data for further processing, such as modulating or 
storing the audio signal. 

As to claim 20 and 28, Gersho et al. (154) teaches recording parameters (col. 29 lines 25- 

35); 

As to claim 21 and 29, Gersho et al. (154) teach, 
the audio data is transmitted through a communication channel and wherein the input of the 
decoder is operatively connected to the communication channel for receiving the audio data 
(digital communications, col. 1, line 1 and Fig. 3). 
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As to claim 26, Gersho et al. (154) teach, a code for determining the characteristics of the 
audio signal (LP coding, col. 8 lines 54- a code for adjustment the parameter based on the 
characteristics of the audio signal for providing an adjusted representation of the parameter, 
wherein said adjusting comprises the steps of segmenting the audio signal into a plurality of 
segments based on the characteristics of the audio signal and encoding the segments based on 
one or more of a plurality of encoding settings (LP coding, modified residual, adjusts frames, 
Abstract and Fig. 9; col. 8 lines 54-63). 

As to claim 30, Gersho et al. (154) teaches a mobile terminal (mobile base station, col. 6, 
lines 17-18). 

As to claim 31, Gersho et al. (154) teaches implementing in a cell phone system which 
necessarily has both base station and mobile station adapted to communicating with the base 
stations (col. 6, lines 33-36); a decoder for use in parametric audio coding for generating a 
synthesized audio signal indicative of an audio signal having audio characteristics, wherein the 
audio signal is coded in a coding step into a plurality of parameters at a data rate and the 
encoding step is adjusted based on the characteristics of the audio signal for providing an 
adjusted representation of the parameters, wherein the said adjusting comprises the steps of 
segmenting the audio signal into a plurality of segments based on the characteristics of the audio 
signal and encoding the segments based on one or more of a plurality of encoding settings (Figs 
1, 4-5, LP coding, modified residual, adjusts frames. Abstract and Fig. 9; col. 8 lines 54-63 ).. 
an input for receiving audio data indicative of the parameters in the adjusted representation from 
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at least one of the base stations for providing the audio data to the decoder, so as to allow the 
decoder to generate the synthesized audio signal based on the adjusted representation (Figs 1, 4- 
5, col. 3 lines 1-15). 

As to claim 32, Gersho et al. (154) teach, an input for receiving audio data indicative of 
end points defining a plurality of sub-segments, wherein the audio signal is encoded for 
providing parameters indicative of the audio signal, the parameters including pitch contour data 
containing a plurality of pitch values representative of an audio segment in time, and wherein the 
pitch contour data in the audio segment in time is approximated by a plurality of consecutive 
sub-segments in the audio segment, and wherein the end points include a first end point and a 
second en d point for defining each of said sub-segments (decoder, col. 6 lines 8-1 1 and Fig. 1); 
and a reconstruction module for reconstructing the audio segment based on the received audio 
data (Fig. 9; col. 6 lines 8-11). 

As to claim 33, Gersho et al. (154) teach encoding settings inherently include bit 
allocation (col. 3 lines 45-49), quantization accuracy (Fig. 5 and col. 1 1 lines 4-16), quantization 
method (col. 1 1 lines 4-16) and parameter update rate (col. 3 lines 31-44 and 56-60). 

As to claim 34, Gersho et al. (154) teach, the audio signal contains sinusoidal 
components (col. 3 lines 25-29, analysis windows made equal becomes sine) and said parameters 
include fi-equency values (Fig. 1 element 68), amplitude values (col. 3 lines 51-55) and phase 
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values indicative of the sinusoidal components (Fig. 1 element 76 and col. 3 lines 25-29). 

As to claim 35, Gersho et al. (154) teach the parameters includes pitch (col. 4 line 60), 
voicing f(Fig. 9 element 42c), amplitude (col. 3 lines 51-55) and energy of the audio signal (col. 
3 lines 42-44). 

As to claim 36, Gersho et al. (154) teach the parameters include pitch contour data (col. 4 
line 60-61) containing a plurality of pitch values inherently representative of an audio segment in 
time (col. 4 lines 59-63 and col. 2 lines 51-64). 

As to claim 37, Gersho et al. (154) teach encoding settings inherently include bit 
allocation (col. 3 lines 45-49), quantization accuracy (Fig. 5 and col. 1 1 lines 4-16), quantization 
method (col. 1 1 lines 4-16) and parameter update rate (col. 3 lines 31-44 and 56-60, Fig. 4, 8-9 
and 14). 

As to claim 40, Gersho et al. (154)teach encoding settings inherently include bit 
allocation (col. 3 lines 45-49), quantization accuracy (Fig. 5 and col. 11 lines 4-16), quantization 
method (col. 1 1 lines 4-16) and parameter update rate (col. 3 lines 31-44 and 56-60, col. 6 lines 
8-11). 

As to claim 41, Gersho et al. (154)teach, wherein the audio signal comprises a plurality of 
frames and the audio signal in each frame has a waveform and wherein the fiirther audio signal is 
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produced in the decoding stage independently of the waveform (col. 14 lines 8-14; col. 13 lines 
62-67 and col. 14 lines 1-7). 

As to claim 42, which depends on claim 1, Gersho et al. (154) teach 

wherein each segment has a segment length and wherein the segment length of at least one 
segment is different from the segment length of at least one other segment (col. 14 lines 8-14; 
col. 13 lines 62-67 and col. 14 lines 1-7). 

As to claim 43, which depends on claim 19, Gersho et al. (154) teach wherein the audio 
signal comprises a plurality of frames and the audio signal in each frame has a waveform and 
wherein the module generates the fiirther audio signal independently of the waveform (col. 14 
lines 8-14, col. 13 lines 62-67 and col. 14 lines 1-7). 
.14). 

As to claim 44, which depends on claim 19, Gersho et al. (154) teach 
wherein the segments comprise segments of different segment lengths (col. 14 lines 8-14). 

As to claim 46, which depends on claim 26, Gersho et al. (154) teach 
wherein the segments comprise segments of different segment lengths (col. 14 lines 8- 
14). 



As to claim 47, which depends on claim 31, Gersho et al. (154) teach 



Application/Control Number: 1 0/692,290 Page 1 1 

Art Unit: 2626 

wherein the segments comprise segments of different segment lengths (col. 14 lines 8- 
14). 

As to claim 48, which depends on claim 32, Gersho et al. (154) teach 
wherein the segments comprise segments of different segment lengths (col. 14 lines 8- 
14). 



4. The following is a quotation of the appropriate paragraphs of 35 U.S. C. 102 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by another filed 
in the United States before the invention by the applicant for patent or (2) a patent granted on an application for 
patent by another filed in the United States before the invention by the applicant for patent, except that an 
international application filed under the treaty defined in section 351(a) shall have the effects for purposes of this 
subsection of an application filed in the United States only if the international application designated the United 
States and was published under Article 21(2) of such treaty in the English language. 



5. Claims 15-18,22-25,38,45 are rejected under 35 U.S.C. 102(e) as being anticipated by 

Sinhaetal (7191 1361 

As per claims 15,22,23,45, Sinha et al ("7191 136^ teaches a method for use in a 
parametric audio coding to encode an audio signal by segmenting the audio signal into a 
plurality of segments based on audio characteristics of the audio signal (by high pass 
filtering the input audio signal (col. 4 lines 47-51), and then performing a non-linear 
parametric representation of the signal - col. 4 lines 53-59; wherein the data amount per 
processing depends upon the fi-equency characteristics of the audio signal, and the 
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characteristics analyzed can be peak analysis, lattice quantization, or frequency range 
selection - col. 3 lines 1-6); encoding the segments with different encoding settings (by 
choosing compression settings on-the-fly col. 6 lines 43-47); also teaching upsampling 
(col. 7 lines 42-44) or downsampling (col. 7 lines 39-46). 

As per claims 16-18,38, Sinha et aK7191 136) teaches quantized and unquantized 
features (col. 3 lines 1-6) 

As per claim 24, Sinha et aK7191 136) teaches storage mediums (col. 8 lines 1-9). 

As per claim 25, Sinha etal (7191136) teaches header information transmitted 
over communication channels (col. 6 line 64 - col. 7 line 7). 

Response to Arguments 

6. Applicant's arguments filed 12/3/07 have been fully considered but are unpersuasive. . 
As per the arguments against the Gersho reference, examiner argues that the current claim scope 
still reads upon the Gersho reference, as detailed in the 35 U.S.C. 1 12 reference above. The 
claim scope has been interpreted according to the comments listed above under the 35 U.S.C. 
1 12 rejection, and under that interpretation, the Gersho reference still applies. As per applicants 
arguments against the 1 12 rejection, examiner disagrees and argues that the reference to page 15 
of the specification shows a tj^ical speech coder that the estimation of speech parameters at 
regular intervals reads on the broad claim scope pertaining to segmentation. Clearly, based upon 
this interpretation of the broad claim scope, the Gersho reference still applies to the claims. 
Furthermore, examiner recommends that applicants also revisit the references cited in the 
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previous office action that could apply to applicants interpretation of the claim scope (that 

section is provided as follows: 

The amendment filed 12/3/07 has canceled the structural relationships that would 
have defined apphcants disclosed invention, namely, the combination of subblocks 12, 
20, and signal flowpath 110 and 112. However, the broad current claim scope still reads 
upon Fig. 4, subblock 12, and the disclosed encoder/extraction process detailed in 
applicants specification with respect to this aspect of applicants disclosure. Furthermore, 
examiner notes that the process in applicant's disclosure depends upon an initial 
partitioning of the audio signal - the parameters that are extracted from subblock contain 
not only voicing/energy/spectral characteristics, but timing information as well, such 
timing information used in subblock 20 to determine the final sectioning of the audio 
signal. If this timing-partitioning information was not available to subblock 20, then the 
decision making process of subblock 20 would be ineffective. Lastly, examiner wishes to 
point to applicant the following prior art - Teguchi (4701955), teaching variable frame 
length vocoder - determining the frame length based on spectral parameters of the audio 
signal (col. 4 lines 29-60; col. 3 lines 40-60); and Manjunath (6434519), Fig.5, 
performing energy parameter analysis on received samples, and performing different 
encoding methods based on different encoding rates (different encoding rate with 
different size frames); Stachurski et al (7039581) applying different mode encoding 
based upon exfracted parameters (these references can be applied to different aspects of 
applicants disclosure). 
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As per the arguments against the Sinha reference, examiner disagrees and argues that the 
segmentation (data set size) is directly dependent upon the frequency characteristics of the 
signal, and as such, still reads on the current broad claim scope. 



Conclusion 



7. THIS ACTION IS MADE FINAL. Apphcant is reminded of the extension of time 
policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1 .136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the mailing 
date of this final action. 



8. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Michael Opsasnick, telephone number (571)272-7623, 
who is available Tuesday-Thiirsday, 9am-4pm. 

If attempts to reach the examiner by telephone are unsuccessfiil, the examiner's 
supervisor, Mr. Richemond Dorvil, can be reached at (571)272-7602. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 



/Michael N. Opsasnick/ 
Primary Examiner, Art Unit 2626 

9/1/08 



